Research Article | Open Access
Volume 2023 |Article ID 0113 | https://doi.org/10.34133/plantphenomics.0113

GenoDrawing: An Autoencoder Framework for Image Prediction from SNP Markers

Federico Jurado-Ruiz,1 David Rousseau,2 Juan A. Botía,3 Maria José Aranzana 1,4

1Center for Research in Agricultural Genomics (CRAG), 08193 Barcelona, Cerdanyola, Spain
2Université d’Angers, LARIS, INRAe UMR IRHS, 49000 Angers, France
3Department of Information and Communi-cation Engineering, University of Murcia, 30071 Murcia, Spain
4IRTA (Institut de Recerca i Tecnologia Agroalimentàries), Barcelona, Spain

Received 
04 Jul 2023
Accepted 
23 Oct 2023
Published
03 Nov 2023

Abstract

Advancements in genome sequencing have facilitated whole-genome characterization of numerous plant species, providing an abundance of genotypic data for genomic analysis. Genomic selection and neural networks (NNs), particularly deep learning, have been developed to predict complex traits from dense genotypic data. Autoencoders, an NN model to extract features from images in an unsupervised manner, has proven to be useful for plant phenotyping. This study introduces an autoencoder framework, GenoDrawing, for predicting and retrieving apple images from a low-depth single-nucleotide polymorphism (SNP) array, potentially useful in predicting traits that are difficult to define. GenoDrawing demonstrates proficiency in its task using a small dataset of shape-related SNPs. Results indicate that the use of SNPs associated with visual traits has substantial impact on the generated images, consistent with biological interpretation. While using substantial SNPs is crucial, incorporating additional, unrelated SNPs results in performance degradation for simple NN architectures that cannot easily identify the most important inputs. The proposed GenoDrawing method is a practical framework for exploring genomic prediction in fruit tree phenotyping, particularly beneficial for small to medium breeding companies to predict economically substantial heritable traits. Although GenoDrawing has limitations, it sets the groundwork for future research in image prediction from genomic markers. Future studies should focus on using stronger models for image reproduction, SNP information extraction, and dataset balance in terms of phenotypes for more precise outcomes.

© 2019-2023   Plant Phenomics. All rights Reserved.  ISSN 2643-6515.

Back to top